Extension of the sasCIF format and its applications for data processing and deposition
نویسندگان
چکیده
Recent advances in small-angle scattering (SAS) experimental facilities and data analysis methods have prompted a dramatic increase in the number of users and of projects conducted, causing an upsurge in the number of objects studied, experimental data available and structural models generated. To organize the data and models and make them accessible to the community, the Task Forces on SAS and hybrid methods for the International Union of Crystallography and the Worldwide Protein Data Bank envisage developing a federated approach to SAS data and model archiving. Within the framework of this approach, the existing databases may exchange information and provide independent but synchronized entries to users. At present, ways of exchanging information between the various SAS databases are not established, leading to possible duplication and incompatibility of entries, and limiting the opportunities for data-driven research for SAS users. In this work, a solution is developed to resolve these issues and provide a universal exchange format for the community, based on the use of the widely adopted crystallographic information framework (CIF). The previous version of the sasCIF format, implemented as an extension of the core CIF dictionary, has been available since 2000 to facilitate SAS data exchange between laboratories. The sasCIF format has now been extended to describe comprehensively the necessary experimental information, results and models, including relevant metadata for SAS data analysis and for deposition into a database. Processing tools for these files (sasCIFtools) have been developed, and these are available both as standalone open-source programs and integrated into the SAS Biological Data Bank, allowing the export and import of data entries as sasCIF files. Software modules to save the relevant information directly from beamline data-processing pipelines in sasCIF format are also developed. This update of sasCIF and the relevant tools are an important step in the standardization of the way SAS data are presented and exchanged, to make the results easily accessible to users and to promote further the application of SAS in the structural biology community.
منابع مشابه
Parleda: a Library for Parallel Processing in Computational Geometry Applications
ParLeda is a software library that provides the basic primitives needed for parallel implementation of computational geometry applications. It can also be used in implementing a parallel application that uses geometric data structures. The parallel model that we use is based on a new heterogeneous parallel model named HBSP, which is based on BSP and is introduced here. ParLeda uses two main lib...
متن کاملA Review on Titanium Nitride and Titanium Carbide Single and Multilayer Coatings Deposited by Plasma Assisted Chemical Vapor Deposition
In this paper, we reviewed researches about the titanium nitride (TiN) and titanium carbide (TiC) single and multilayer coatings. These coatings were deposited by the plasma assisted chemical vapor deposition (PACVD) technique. Plasma-based technologies are used for the processing of thin films and coatings for different applications such as automobile and aerospace parts, computer disc drives,...
متن کاملIntelligent Modeling of Permeate Flux during Membrane Clarification of Pomegranate Juice
Background and Objectives: One of the problems in juice membrane clarification is the accumulation and deposition of rejected compounds on membrane surfaces or inside its pores which results in a membrane fouling. Materials and Methods: Several parameters can have influence on fouling in one hand and prediction of juice permeates flux during the membrane processing is of importance in indust...
متن کاملPromoting Organizational Entrepreneurship in Iran: Evidences From Agricultural Extension Workers
The main purpose of this study was to investigate the role of Knowledge Management (KM) in Organizational Entrepreneurship (OE) among agriculture extension workers at Kermanshah Township, Iran. The statistical population in this study consisted of all agriculture extension workers of Jihad-e-Agriculture management and centers of agricultural services at Kermanshah Township (N=143), of whom 129 ...
متن کاملIntegration of artificial neural network and geographic information system applications in simulating groundwater quality
Background: Although experiments on water quality are time consuming and expensive, models are often employed as supplement to simulate water quality. Artificial neural network (ANN) is an efficient tool in hydrologic studies, yet it cannot predetermine its results in the forms of maps and geo-referenced data. Methods: In this study, ANN was applied to simulate groundwater quality ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 49 شماره
صفحات -
تاریخ انتشار 2016